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e-j . Abstract 

We consider an isolated, macroscopic quantum system. Let Jif be a micro- 
canonical "energy shell," i.e., a subspace of the system's Hilbert space spanned 
by the (finitely) many energy eigenstates with energies between E and E + 5E. 
q-i The thermal equilibrium macro-state at energy E corresponds to a subspace Jrf? eq 

of ffl such that dimJ4? eq / dim is close to 1. We say that a system with state 
vector ip G Jf? is in thermal equilibrium if tp is "close" to ^f eq . We show that for 
"typical" Hamiltonians with given eigenvalues, all initial state vectors evolve 
■ in such a way that ipt is in thermal equilibrium for most times t. This result is 

[ — . closely related to von Neumann's quantum ergodic theorem of 1929. 



PACS: 05.30.-d; 03.65.-w. Key words: equilibration/thermalization of macro- 
scopic quantum systems, generic/typical Hamiltonian. 



1 Introduction 

If a hot brick is brought in 
isolated, then energy will flow from the hot to the cold brick until their temperatures 



X ... 

If a hot brick is brought in contact with a cold brick, and the two bricks are otherwise 
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become equal, i.e., the system equilibrates. Since the bricks ultimately consist of elec- 
trons and nuclei, they form a quantum system with a huge number (> 10 20 ) of particles; 
this is an example of an isolated, macroscopic quantum system. 

From a microscopic point of view the state of the system at time t is described by a 
vector 

i;(t) = e-^V(O) (1) 
in the system's Hilbert space or a density matrix 

p(t) = e- lHt p(0)e iHt , (2) 

where H is the Hamiltonian of the isolated system and we have set h — 1. In this 
paper we prove a theorem asserting that for a sufficiently large quantum system with a 
"typical" Hamiltonian and an arbitrary initial state ip(0), the system's state ip(t) spends 
most of the time, in the long run, in thermal equilibrium. (Of course, before the system 
even reaches thermal equilibrium there could be a waiting time longer than the present 
age of the universe.) This implies the same behavior for an arbitrary p(0). 

This behavior of isolated, macroscopic quantum systems is an instance of a phe- 
nomenon we call normal typicality [5 J , a version of which is expressed in von Neumann's 
quantum ergodic theorem [T7| . However, our result falls outside the scope of von Neu- 
mann's theorem, because of the technical assumptions made in that theorem. Our result 
also differs from the related results in jU [T51 [TBI [121 EH ED], which use different notions 
of when a system is in an equilibrium state. In particular they do not regard the ther- 
mal equilibrium of an isolated macroscopic system as corresponding to its wave function 
being close to a subspace Jif eq of Hilbert space. See Section 6 for further discussion. 

The rest of this paper is organized as follows. In the remainder of Section 1, we 
define more precisely what we mean by thermal equilibrium. In Section 2 we outline 
the problem and our result, Theorem 1. In Section 3 we prove the key estimate for the 
proof of Theorem 1. In Section 4 we describe examples of exceptional Hamiltonians, 
illustrating how a system can fail to ever approach thermal equilibrium. In Section 5 
we compare our result to the situation with classical systems. In Section 6 we discuss 
related works. 

1.1 The equilibrium subspace 

Let J^totai be the Hilbert space of a macroscopic system in a box A, and let H be its 
Hamiltonian. Let {4> a } be an orthonormal basis of ^totai consisting of eigenvectors of H 
with eigenvalues E a . Consider an energy interval [E, E + 5E], where SE is small on the 
macroscopic scale but large enough for the interval [E, E + SE] to contain very many 
eigenvalues. Let J$? C J^otai be the corresponding subspace, 

Jt? = span{0 Q : E a G [E, E + 5E] } . (3) 

A subspace such as M 3 is often called a micro- canonical energy shell. Let D be the 
dimension of M~ i.e., the number of energy levels, including multiplicities, between E 
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and E + 8E. In the following we consider only quantum states tp that lie in J^, i.e., of 
the form 

tp = ^2c a (p a (4) 

a 

with c a 7^ only for a such that E a G [E, E + SE]. 

According to the analysis of von Neumann [T7J dB] and others (cf. [6]), the macro- 
scopic (coarse-grained) observables in a macroscopic quantum system can be naturally 
"rounded" to form a set of commuting operators, 

•MM, ■ ,• (5) 

The operators are defined on J^totab but since we can take them to include (and thus 
commute with) a coarse-grained Hamiltonian, we can (and will) take them to commute 
with the projection to Jif, and thus to map J4? to itself. We write v = (m 1; . . . , m^) for 
a list of eigenvalues rrii of the restriction of Mj to and for the joint eigenspace. 
Such a set of operators generates an orthogonal decomposition of the Hilbert space 

^ = 0X, (6) 

V 

where each J^,, called a macro-space, represents a macro-state of the system. The 
dimension of M? v is denoted by d u ; note that ^j V d v = D . If any .ffl v has dimension 0, 
we remove it from the family {J%,}. In practice, d v 3> 1, since we are considering a 
macroscopic system with coarse-grained observables. 

It can be shown in many cases, and is expected to be true generally, that among 
the macro-spaces M'y there is a particular macro-space one corresponding to 

thermal equilibrium, such that 

d eq /D w 1 , (7) 

indeed with the difference 1 — d eq /D exponentially small in the number of particles!]] 
This implies, in particular, that each of the macro-observables Mi is "nearly constant" 
on the energy shell in the sense that one of its eigenvalues has multiplicity at least 
d eq ~ D. We say that a system with quantum state ip (with ||^|| = 1) is in thermal 
equilibrium if if} is very close (in the Hilbert space norm) to J4? eq , i.e., if 

{^\P eq \^)^l, (8) 

where P eq is the projection operator to Jt? eq . The corresponding relation for density 
matrices is 

Tir(iV)«l. (9) 



lr This dominance of the equilibrium state can be expressed in terms of the (Boltzmann) entropy 
S v of a macroscopic system in the macro-state is, be it the equilibrium state or some other (see [S]), 
defined as S v — fc_B logdi,, where is the Boltzmann constant: d eq /D being close to 1 just expresses 
the fact that the entropy of the equilibrium state is close to the micro-canonical entropy S mc , i.e., 

Seq = k B l0g4g ~ k B log D = S mc . 
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The condition (jSJ) implies that a quantum measurement of the macroscopic observable Mj 
on a system with wave function ip will yield, with probability close to 1, the "equilibrium" 
value of Mj. Likewise, a joint measurement of Mi, . . . , Mf. will yield, with probability 
close to 1, their equilibrium values. 

Let n(dip) be the uniform measure on the unit sphere in [111 [19]. It follows from 
(J7j) that most ip relative to fi are in thermal equilibrium. Indeed, 

/X d 
(i>\P eg \i>) M#) = ^Tr P eq = -jL « 1 . (10) 

Since the quantity (ip\P eq \il>) is bounded from above by 1, most ip must satisfy (JSj) JHI 



1.2 Examples of equilibrium subspaces 

To illustrate the decomposition into macro-states, we describe two examples. As Exam- 
ple 1, consider a system composed of two identical subsystems designated 1 and 2, e.g., 
the bricks mentioned in the beginning of this paper, with Hilbert space ^totai = 
The Hamiltonian of the total system is 

H = H 1 + H 2 + XV, (11) 

where Hi and H 2 are the Hamiltonians of subsystems 1 and 2 respectively, and AV^ is 
a small interaction between the two subsystems. We assume that Hi, H 2 , and H are 
positive operators. Let J4? be spanned by the eigenfunctions of H with energies between 
E and E + SE. 

In this example, we consider just a single macro-observable M, which is a projected 
and coarse-grained version of Hi/E, i.e., of the fraction of the energy that is contained 
in subsystem 1 alone. We cannot take M to simply equal Hi/E because Hi is defined 
on J^totab n °t 3%* 1 an d will generically not map ffl to itself, while we would like M to be 
an operator on J^. To obtain an operator on Jtf, let P be the projection J#t tai — > 
and set 

H'i = PHiP (12) 

(more precisely, H[ is PH\ restricted to J^ 5 ). Note that H[ is a positive operator, but 
might have eigenvalues greater than E. Now defin^f] 

M = f(H[/E) (13) 

with the coarse-graining function 

'0 if x< 0.01, 
0.02 ifx 6 [0.01,0.03), 
K ' I 0.04 if x e [0.03,0.05), v s 

k etc. 

2 It should in fact be true for a large class of observables A on ^ that, for most ip relative to /i, 
(^|j4|^i) ~ Ti(p mc A), where p mc is the micro-canonical density matrix, i.e., 1/D times the identity on 
. This is relevant to the various results on thermalization described in Section [6l 
3 Recall that the application of a function / to a self-adjoint matrix A is defined to be f(A) = 
/( a a)l < / ? a)( ( ^a| if the spectral decomposition of A reads A = a a \ip a )(tp a \. 
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The M'y are the eigenspaces of M; clearly, ® v ^ v = Jtf '■ If, as we assume, XV is small, 
then we expect ^0.5 = Jt? eq to have the overwhelming majority of dimensions. In a 
thorough treatment we would need to prove this claim, as well as that H[ is not too 
different from Hi, but we do not give such a treatment here. 

As Example 2, consider N bosons (fermions) in a box A = [0,L] 3 C M 3 ; i.e., J^totai 
consists of the square-integrable (anti-)symmetric functions on A^. Let the Hamiltonian 
be 

1 - 

# = -^E v * + E^-^i), as) 

i=l i<j 

where the Laplacian has Dirichlet boundary conditions, v(r) is a given pair potential, 
and q i is the triple of position coordinates of the i-th particle. Let again be spanned 
by the eigenfunctions with energies between E and E + 5E. 

In this example, we consider again a single macro-observable M, based on the oper- 
ator A^ieft for the number of particles in the left half of the box A: 

AW(<7i, • • • , Qn) = #{* = Qi e [0, L/2] x [0, L] 2 } ^{q x , ...,q N ). (16) 

Note that the spectrum of A^ e ft consists of the AT + 1 eigenvalues 0, 1, 2, . . . , N . To 
obtain an operator on ffi ', let P be the projection J^t ota i — > Jf? and set A^ eft = PNi e f t P. 
Note that the spectrum of N( cft is still contained in [0, AT]. Now define M = f(N{ e{t /N) 
with the coarse-graining function (1141) . We expect that for large N, the eigenspace with 
eigenvalue 0.5, J$? eq = J#o5, has the overwhelming majority of dimensions (and that 



2 Formulation of problem and results 

Our goal is to show that, for typical macroscopic quantum systems, 

(ip{t)\P eq \^{t)) w 1 for most t. (17) 

To see this, we compute the time average of (ip(t)\P eq \il>(t)) . We denote the time average 
of a time-dependent quantity f(t) by a bar, 



f(t)= lim i f dtf(t). 



Since (if)(t)\P eq \if)(t)) is always a real number between and 1, it follows that if its time 
average is close to 1 then it must be close to 1 most of the time. Moreover, for //-most 
^(0), where /x is the uniform measure on the unit sphere of Jif, ij)(t) is in thermal 
equilibrium most of the time. This result follows from Fubini's theorem (which implies 
that taking the /i-average commutes with taking the time average) and the unitary 
invariance of fx: 



Mt)|Pe,hK*)M#) = (Me im P eq e- im \iP) m#) = /(viWM#)~i- (19) 
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That is, the ensemble average of the time average is near 1, so, for //-most ip(0), the 
time average must be near 1, which implies our claim above. So the interesting question 
is about the behavior of exceptional ijj(0), e.g., of systems which are not in thermal 
equilibrium at t = 0. Do they ever go to thermal equilibrium? 

As we will show, for many Hamiltonians the statement (fl71) holds in fact for all 
ip(0) G M 3 . From now on, let H denote the restriction of the Hamiltonian to and let 
0i, ...,</>£) be an orthonormal basis of consisting of eigenvectors of the Hamiltonian 
H with eigenvalues E%, . . . , E D . If 

D 

Ca a j C Q (0a \m) (20) 

a=l 

then 

a=l 

Thus, 

D 

s©ra^®y= E cpmpm ■ (22) 

a,/3=l 

If H is non-degenerate (which is the generic case) then E a — Ep vanishes only for a — f3, 
so the time averaged exponential is 5 a /3, and 

D 
a=l 

Thus, for the system to be in thermal equilibrium most of the time it is necessary and 
sufficient that the right hand side of fl23|) is close to 1. 

Now if an energy eigenstate <p a is not itself in thermal equilibrium then when "0(0) = 
<p a the system is never in thermal equilibrium, since this state is stationary. Conversely, 
if we have that 

((p a \P eq \(pa) ~ 1 for all a, (24) 

then the system will be in thermal equilibrium most of the time for all ip(0). This follows 
directly from ( |23i) since the right hand side of ( 1231) is an average of the {4> a \P e q\4> a ) ■ We 
show below that ( 1241) is true of "most" Hamiltonians, and thus, for "most" Hamiltonians 
it is the case that every wave function spends most of the time in thermal equilibrium. 

2.1 Main result 

The measure of "most" we use is the following: for any given D (distinct) energy 
values Ei, . . . , E^, we consider the uniform distribution /iHam over all Hamiltonians 
with these eigenvalues. Choosing H at random with distribution /^Ham is equivalent 
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to choosing the eigenbasis {<f) a } according to the uniform distribution honb over all 
orthonormal bases of Jf, and setting H = ^2 a E a \(p a ){(f) a \. The measure Honb can be 
defined as follows: Choosing a random basis according to honb amounts to choosing 0i 
according to the uniform distribution over the unit sphere in Jf, then (f>2 according to 
the uniform distribution over the unit sphere in the orthogonal complement of 0i, etc. 
Alternatively, Honb can be defined in terms of the Haar measure Hu(d) on the group 
U(D) of unitary D x D matrices: any given orthonormal basis {Xa} of M 3 defines a 
one-to-one correspondence between U(D) and the set of all orthonormal bases of J^ 5 , 
associating with the matrix U = {U a p) G U(D) the basis 

D 

<fra = ^ U °<PXP ' ( 25 ) 
p=l 

the image of the Haar measure under this correspondence is in fact independent of the 
choice of {xp} (because of the invariance of the Haar measure under right multiplication), 
and is Honb- 

Put differently, the ensemble /iHam of Hamiltonians can be obtained by starting from 
a given Hamiltonian Ho on M 3 (with distinct eigenvalues E\, . . . , Ed) and setting 

H = UHoU- 1 (26) 

with U a random unitary matrix chosen according to the Haar measure. Note that, 
while considering different possible Hamiltonians H in we keep J^ eq fixed, although 
in practice it would often be reasonable to select J^ eq in a way that depends on H (as 
we did in the examples of Section [L2|) . 

For our purpose it is convenient to choose the basis {Xa} in such a way that the first 
d eq basis vectors lie in Jif eq and the other ones are orthogonal to Jtf^q- Then, we have 
that 



'a 



Pe q \<Pa) = J2\ U ^\ 2 ( 2? ) 

(3=1 

with U a /3 the unitary matrix satisfying (1251) . 

We will show first, in Lemma 1, that for every < e < 1, if D is sufficiently large 
and d eq /D sufficiently close to 1, then most orthonormal bases {<f) a } are such that 

(<Pa\P eq \<Pa) >l-e for all a . (28) 

This inequality is a precise version of ([M])- How close to 1 should d eq /D be? The fact 
that the average of (ip\P eq \ip) over all wave functions ip on the unit sphere of equals 
d eq /D, mentioned already in (fTUj) . implies that (j28l) cannot be true of most orthonormal 
bases if d eq / D < 1 — e. To have enough wiggling room, we require that 

%>1-|. (29) 
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We will show then, in Theorem 1, that for every (arbitrarily small) < rj < 1 and for 
sufficiently large D, most H are such that for every initial wave function ^(0) € Jif with 
H^O) || = 1, the system will spend most of the time in thermal equilibrium with accuracy 
l — i], where we say that a system with wave function ip is in thermal equilibrium with 
accuracy 1 — rj if 

{^\P eq \^)>l-r ] . (30) 

This inequality is a precise version of (jSJ). In order to have no more exceptions in time 
than the fraction < 5' < 1, we need to set the e in (1251) and (J2"5j) equal to rj5'. 

Lemma 1. Let Hu{d) denote the Haar measure on U(D), and 

y £ := [U e U[D) Ma : \ U ^\ 2 > 1 ~ £ } • ( 31 ) 

13=1 

Then for all < e < 1 and < 5 < 1, there exists D = D (e, 5) > such that 

if D> D and d eq > (1 - e/2)D then /i U (d)(^s) >l-S. (32) 

The proof of Lemma [1] is given in Section [3j It also shows that Do can for example 
be chosen to be 

D {e, S) = maxjioV 2 log(4/5), 10 6 e" 4 ) . (33) 

From (1271). we obtain: 



Theorem 1. For all r],S,8' G (0, 1), all integers D > D (j]8',S) and all integers d eq > 
(l—i]5'/2)D the following is true: Let J$? be a Hilbert space of dimension D; let J$? eq be a 
subspace of dimension d eq ; let P eq denote the projection to J4? eq ; let E\, . . . , Ed be pairwise 
distinct but otherwise arbitrary; choose a Hamiltonian at random with eigenvalues E a 
and an eigenbasis 4> a that is uniformly distributed. Then, with probability at least 1 — 5, 
every initial quantum state will spend (1 — 5') -most of the time in thermal equilibrium 
as defined in ( |30|) . i.e., 



lim inf — 



{0<t<T:(^(t)\PeMt))> 1 -v} >l-S', (34) 



where \M\ denotes the size (Lebesgue measure) of the set M. 

Proof. It follows from Lemma [1] that under the hypotheses of Theorem [H 



m)\Pe q \m)> 1 -^' 



with probability at least 1 — 5. Thus, since r)5' > 1 — (ijj(t)\P eq \^(t)) > r}5, where 5 is 
the limsup^^oo of the fraction of the time in (0,T) for which (ip(t)\P eq \il)(t)) < 1 — r), it 
follows that 5 < 5'. □ 
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2.2 Remarks 



Normal typicality. Theorem [T] can be strengthened; with the same sense of "most" as 
in Theorem [TJ we have that for most Hamiltonians and for all if)(0) 

(^(t)|P^(t))«^^, for all v (35) 

for most t. For v = eq, this implies that (ip(t)\P eq \ip(t)) ~ 1. This stronger statement 
we have called normal typicality [5]. A version of normal typicality was proven by von 
Neumann [17]. However, because of the technical assumptions he made, von Neumann's 
result, while much more difficult, does not quite cover the simple result of this paper. 

Typicality and probability. When we express that something is true for most H or 
most ip relative to some normalized measure p, it is often convenient to use the language 
of probability theory and speak of a random H or ip chosen with distribution p. However, 
by this we do not mean to imply that the actual H or ip in a concrete physical situation 
is random, nor that one would obtain, in repetitions of the experiment or in a class of 
similar experiments, different if's or ip's whose empirical distribution is close to p. That 
would be a misinterpretation of the measure p, one that suggests the question whether 
perhaps the actual distribution in reality could be non-uniform. This question misses 
the point, as there need not be any actual distribution in reality. Rather, Theorem 1 
means that the set of "bad" Hamiltonians has very small measure /^Ham- 

Consequences for Example 2. From Lemma 1 it follows for Example 2 that typical 
Hamiltonians of the form (I26I) with Hq given by the right hand side of (fl5|) are such 
that all eigenfunctions are close to J#o.5, this fact in turn strongly suggests (though 
we have not proved this) that the eigenfunctions are essentially concentrated on those 
configurations that have approximately 50% of the particles in the left half and 50% in 
the right half of the box. 

Equilibrium Statistical Mechanics. Theorem 1 implies that, for typical H, every 
ip(0) G Jif is such that for most t, 

(m\Mi\m)-^(pm C M t ), (36) 

where p mc is the standard micro-canonical density matrix (i.e., 1/D times the projection 
^totai — ► for all macro-observables Mj as described in Section [LT1 This justifies 
replacing \if)(t))(i/j(t)\ by p mc as far as macro-observables in equilibrium are concerned. 
However, this does not, by itself, justify the use of p mc for observables A not among the 
{Mi}. For example, consider a microscopic observable A that is not "nearly constant" 
on the energy shell J^. Then, standard equilibrium statistical mechanics tells us to use 
p mc for the expected value of A in equilibrium. We believe that this is in fact correct for 
most such observables, but it is not covered by Theorem 1. Results concerning many 
such observables are described in Section [6j These results, according to which, in an 
appropriate sense, 

(m\A\m)~MpmcA) (37) 
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for suitable A and ip(0), are valid only in quantum mechanics. The justification of the 
broad use of p mc in classical statistical mechanics relies on rather different sorts of results 
requiring different kinds of considerations. 

3 Proof of Lemma U 

Proof. Let us write P for the Haar measure fiu(D), and let 

D d eq 

p :=p (n{x>^i 2 >i-^). (38) 



a=l p=l 



Observe that 



p=l-F(\j{j^\U a p\ 2 <l-e}) (39) 

a=l (3=1 

> l-DmaxP|^|f/ Q/3 | 2 < (40) 

P=l 

Since U = (U a p) is a random unitary matrix with Haar distribution, its a-th column is 
a random unit vector U := {U a p)p whose distribution is uniform over the unit sphere 
of C D (i.e., the distribution is, up to a normalizing constant, the surface area measure). 
Therefore, the probability in the last line does not, in fact, depend on a, and so the step 
of taking the maximum over a can be omitted. 

A random unit vector such as U can be thought of as arising from a random Gaussian 
vector G by normalization: Let Gp for (3 = 1, . . . , D be independent complex Gaussian 
random variables with mean and variance E|G/3| 2 = 1/D; i.e., KeGp and ImGg are 
independent real Gaussian random variables with mean and variance 1/2D. Then the 
distribution of G = (Gi, . . . , Gd) is symmetric under rotations from U(D), and thus 

G D 

U in distribution, with ||G|| 2 = \Gp\ 2 . (41) 

p=i 



\\G\\ 

We thus have that 



P>1-Dp(V^<1-4. (42) 
p=~i \\G\\ 2 ~ 

To estimate the probability on the right hand side of (142!) . we introduce three different 
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events: 



A( V ') := {\\\Gf - 1\ < V '} , (43) 

:={(!" V')^ < E l^l 2 < C 1 + ^} > ( 44 ) 

(3=1 

:={(!- < E ]3£ < (1 + • (45) 



Let us now assume that 

D 2 



(46) 



We then have that 

(l-e/2)^>l- e + j>l-e, (47) 

so that 

C(e/2) C {(1 - £ /2)% < V C (l - e < V (48) 

and thus 

p>l-Z)P(C c (£/2)), (49) 

where the superscript c means complement. Our goal is to find a good upper bound for 
F{C c {e/2)). 

If the event A (rf) occurs for < rf < \ then 

1 - rf < -4— <l + 2r)', (50) 
||G|| 2 

and consequently, if A (rf) fl B(rj") occurs then 



deq 



£|g„| 



2 



^(1 - V)(l - V) < < ^(1 + V)(l + V) • (51) 

It is now easy to see that A(r)')nB(r)") C C(2r)'+r)"+2r)'r)"), so if we choose 77' = 77" = e/8 
we obtain that 

^(|)n5(|)CC(f£ + ie 2 )CC(£/2) for 0<e<l. (52) 
We thus have the following upper bound: 

F(C c (e/2)) < F(A c (e/8)) + F(B c (e/8)) . (53) 
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To find an estimate of F(A(e/8)) and P(B(e/8)) we use the Large Deviations Prin- 
ciple. It is convenient to use a slightly stronger version of this principle than usual, see 
Section 2.2.1 of [3J, which states that for a sequence of N i.i.d. random variables X i; 



N y 

P (J^ ^ _ E (X l )\ > 5) < 2e- NI ^ x ^ (54) 
i=i 

where I(x) is the rate function [3J associated with the distribution of the X iy defined to 
be 

I{x) = sup(#x - logEe eXl ) . (55) 

In our case, where X, will be the square of a standard normal random variable, the rate 
function is 

I(x) = \{x — 1 — logs) \/x > 1 , (56) 

as a simple calculation shows. 
To estimate P(A(e/S)), set 

N = 2D, Xp = 2D(ReG (3 ) 2 , X D+I3 = 2D(lmGfs) 2 for (3 = 1, . . . , D . (57) 

Thus, for % = 1, . . . , 2D, the Xj are i.i.d. variables with mean EXj = 2DK(KeGi) 2 = 1; 
we thus obtain 

P(A c (e/8)) = P||||G|| 2 -1| >e/8} = (58) 

I > e/8) (59) 



P {IE 




/3=1 




2D 

'{IE 

i=l 


X 

2L> 



(60) 



< 2e - 2D/ ( 1+£ / 8 ) (61) 

= 2e -0( £ /8-log(l+ £ /8)) (Q 2 ) 

Ds 2 ^ 
192. 



<2exp(-^). (63) 



In the last step we have used that log(l + x) < x — x 2 /3 for < x < 1/2. 
We use a completely analogous argument for B, setting 

N = 2d eq , X f3 = 2D(ReGp) 2 , X D+f3 = 2D{lmG fS ) 2 , for (5 = 1, . . . , d eq , (64) 
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and obtain that 

d. 



¥(B%e/S)) = p{|£ IG,! 2 -%\/%> e/s} (65) 
/3=i 

i=i eq 

<2exp(-^). (67) 



192 

From ([53D, ([63D, and ([67D it follows that 

d eq e 2 \ ( De' 2 \ ( De 



P(C'W2)) < 2exp(-^) + 2exp(-^) < 4exp(-^) , (68) 
where we have used that d eq > D/2. Therefore, by ( 1491) . 

p>l-4Dexp(-^). (69) 

The last term converges to as D — > oo, so there exists a Dq > such that for all 
D > D , 

p>l-S, (70) 



which is what we wanted to show. In order to check this for the Dq specified in ( 1331 ) 
right after Lemma [H note that the desired relation 

4Dexp(-^-) < 5 (71) 



384 

is equivalent to 



^(ll-^l>«. (72) 



,384 D 

Thus, it suffices that D > 10V 2 log(4/5) and 

^ < 10- V . (73) 

Since logD < \[T) for all positive numbers D, condition ( 1731) will be satisfied if VD > 
10 3 £" 2 i.e., if D > 10 6 £" 4 . □ 



4 Examples of systems that do not approach ther- 
mal equilibrium 

We shall now present examples of atypical behavior, namely examples of "bad" Hamilto- 
nians, i.e., Hamiltonians for which not all wave functions approach thermal equilibrium 
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(or, equivalently, for which (1241 is not satisfied). According to Theorem [H bad Hamil- 
tonians form a very small subset of the set of all Hamiltonians. Of course, to establish 
that ([21]) holds for a particular Hamiltonian can be a formidable challenge. Moreover, 
the small subset might include all standard many-body Hamiltonians (e.g., all those 
which are a sum of kinetic and potential energies). But there is no a priori reason to 
believe that this should be the case. 

The first example consists of two non-interacting subsystems. This can be expressed 
in the framework provided by Example 1 in Section 11.21 with the Hamiltonian H = 
Hi + H 2 + XV by setting A = 0. Let {<$]} be an orthonormal basis of Jifi consisting of 
eigenvectors of H\ with eigenvalues E}, and {(f]} one of J^ 2 consisting of eigenvectors 
of H 2 with eigenvalues E?. Clearly, for A = not every wave function will approach 
thermal equilibrium. After all, in this case, the <p\ (g) 0| form an eigenbasis of H, while 

JT = spanj^ 1 ® 4>) : E\ + E] e[E,E + 5E] } (74) 

and 

Jif eq = span{</>! ® 0| : E\ G [0A9E, 0.5LE) and E\ + E) e[E,E + 5E] } . (75) 

Thus, any <j>\ ® such that E\ + E] e [E,E + 5E] but, say, E\ < 0A9E, will be an 
example of an element of Jif that is orthogonal to Jif eq and, as it is an eigenfunction of 
H, forever remains orthogonal to J#iq- 

As another example, we conjecture that some wave functions will fail to approach 
thermal equilibrium also when A is nonzero but sufficiently small. We prove this now 
for a slightly simplified setting, corresponding to the following modification of Example 
1 of Section [L2l For the usual energy interval [E, E + SE], let J4? be, independently of 
A, given by (174]) . and, instead of Hi + H 2 + XV, let H be given by 

H = H(X) = P(Hi + H 2 + XV)P , (76) 

where P is the projection to Jtif. Then H defines a time evolution on ffl that depends 
on A. (Note that J^ 5 is still an "energy shell" for all sufficiently small A, as all nonzero 
eigenvalues of H (A) are still contained in an interval just slightly larger than [E, E + 5E], 
and the corresponding eigenvectors lie in Jif.) Let Jf eq for A 7^ also be given by (J75]l . 
Again, choose one particular (p\ and one particular 0^ (independently of A) so that 
E\ + Ej e [E,E + 5E] and E\ < 0A9E, and consider as the initial state of the system 
again 

V(* = 0) = #®$, (77) 

which evolves to 

i/j(X,t) = e-^Vi ® $ • (78) 

Suppose for simplicity that H(X = 0) = Hi + H 2 is non-degenerate0 Then, according to 
standard results of perturbation theory [7j, also H(X), regarded as an operator on Jif, 

4 Since this requires that no eigenvalue difference of H\, Ej — Ej,, coincides with an eigenvalue 
difference of H2 , E? — E?, , we need to relax our earlier assumption that system 1 and system 2 be 
identical; so, let them be almost identical, with slightly different eigenvalues, and let Hi and H2 each 
be non-degenerate. 
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is non-degenerate for all A G (— A , A ) for some A > 0; moreover, its eigenvalues -E"(A) 
depend continuously (even analytically) on A, and so do the eigenspaces. In particular, 
it is possible to choose for every A G (— A , A ) a normalized eigenstate 0(A) G ffl of 
H(\) with eigenvalue E(X) in such a way that 0(A) and E(X) depend continuously on 
A, and 0(A = 0) = <j>\ <g> </yj. 

We are now ready to show that for sufficiently small A > 0, 

<VKA,t)|P e ^(A,t)>^0 (79) 

for all t; that is, ip(X, t) is nearly orthogonal to J$? eq for all t, and thus is never in thermal 
equilibrium. To see this, note first that since 0(0) ~ 0(A) for sufficiently small A, and 
since e~ lH ^ 1 is unitary, also 

e- iHWt <J)(0) « e- iH{x)t (j)(X) (80) 
(with error independent of t). Since the right hand side equals 

e- iE(x)t (j)(\) « e- iE{x)t (j)(0) , (81) 

we have that 

(e-^ A )V(O)|P eg |e- iH(A)i 0(O)) « (0(O)|P e(? |0(O)> = . (82) 

This proves (|79l) with an error bound independent of t that tends to as A — > 0. 

Another example of "bad" Hamiltonians is provided by the phenomenon of Anderson 
localization (see in particular [UlU]): Certain physically relevant Hamiltonians possess 
some eigenf unctions Q that have a spatial energy density function that is macroscopi- 
cally non-uniform whereas wave functions in J^ eq should have macroscopically uniform 
energy density over the entire available volume. Thus, some eigenf unctions are not close 
to J^ eq , violating (J23D- 



5 Comparison with classical mechanics 

In classical mechanics, one would expect as well that a macroscopic system spends 
most of the time in the long run in thermal equilibrium. Let us define what thermal 
equilibrium means in classical mechanics. (We defined it for quantum systems in (jSJ).) 
We denote a point in phase space by X = (q 1 , . . . , q N , p 1; . . . , p N ). Instead of the 
orthogonal decomposition of M 3 into subspaces J% we consider a partition of an energy 
shell T in phase space, T = {X : E < H(X) < E + 5E}, into regions T u corresponding 
to different macro-states u, i.e., if the micro-state X of the system is in r„ then the 
macro-state of the system is v. It has been shown |8J for realistic systems with large 
iV that one of the regions T u , corresponding to the macro-state of thermal equilibrium 
and denoted T eq , is such that, in terms of the (uniform or Liouville) phase space volume 
measure \x on T, 

nit- \ 

1 . (83) 
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Though the subspaces J^, play a role roughly analogous to the regions T u , a basic 
difference between the classical and the quantum cases is that while every classical 
phase point in T belongs to one and only one T u , and thus is in one macro-state, a 
quantum state if) need not lie in any one Jif u , but can be a non-trivial superposition of 
vectors in different macro-states. (Indeed, almost all if) do not lie in any one 3^ v . That 
is why we defined being in thermal equilibrium in terms of if) lying in a neighborhood 
of rather than lying in itself.) 

The time evolution of the micro-state X is given by the solution of the Hamiltonian 
equations of motion, which sends X (at time 0) to X t (at time t), t G 1. We expect that 
for realistic systems with a sufficiently large number N of constituents and for every 
macro-state u, most initial phase points X G T v will be such that X t spends most of 
the time in the set T eq . This statement follows if the system is ergodicp but in fact is 
much weaker than ergodicity. Theorem [T] is parallel to this statement in that it implies, 
for typical Hamiltonians, that initial states (here, if)(0)) out of thermal equilibrium will 
spend most of the time in thermal equilibrium; it is different in that it applies, for typical 
Hamiltonians, to all, rather than most, initial states if>(0). 

6 Comparison with the literature 

Von Neumann [17] proved, as his "quantum ergodic theorem," a precise version of 
normal typicality (defined in Section 12.21) ; his proof requires much more effort, and 
more refined methods, than our proof of Theorem 1. However, his theorem assumes 
that the dimension d u of each macro-space 3^ v is much smaller than the full dimension 
D, and thus does not apply to the situation considered in this paper, in which one of the 
macro-spaces, J4? eq , has the majority of dimensions. The reason von Neumann treated 
the more difficult case of small d v but left out the easier and particularly interesting 
case of the thermal equilibrium macrostate is that he had in mind a notion of thermal 
equilibrium different from ours. He thought of a thermal equilibrium wave function if>, 
not as one in (or close to) a particular J$? u , but as one with HP^H 2 »s d u /D for every 
v, i.e., one for which \ip)(ip\ ~ p mc in a suitable coarse-grained sense. Because of this 
different focus, he did not consider the situation presented here. We also note that von 
Neumann's quantum ergodic theorem makes an assumption on H that we do not need in 
our Theorem 1; this assumption, known as a "no resonances" [HI IS] or "non-degenerate 
energy gaps" [10] condition, asserts that 



The Schnirelman Theorem ]2] states that, in the semi-classical limit and under suit- 
able hypotheses, the Wigner distribution corresponding to an eigenstate (f) a becomes the 

5 A classical system is ergodic if and only if the time evolved micro-state Xt spends, in the long run, 
a fraction of time in each (measurable) set B CT that is equal to /j,(B)/fi(T) for /i-almost all X. 




(84) 
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micro-canonical measure. That is, the <\> a have a property resembling thermal equilib- 
rium, similar to our condition (|24l) expressing that all eigenstates are in thermal equilib- 
rium. Srednicki [T5] observed other thermal equilibrium properties in energy eigenstates 
of example systems, a phenomenon he referred to as "eigenstate thermalization." 

The results of [T6l [T2| [TO] also concern conditions under which a quantum system 
will spend most of the time in "thermal equilibrium." For the sake of comparison, their 
results, as well as ours, can be described in a unified way as follows. Let us say that a 
system with initial wave function ^(0) equilibrates relative to a class srf of observables 
if for most times r, 

{iI){t)\A\iI){t)) « T±(\ij)(t))($(t)\A\ for ail A est. (85) 

We then say that the system thermalizes relative to srf if it equilibrates and, moreover, 

Tr (V(t))(?Kt)|A) « Tr (p mc A) for allied, (86) 

with p mc the micro-canonical density matrix (in our notation, 1/D times the projection 
P to Jf?) . With these definitions, the results of [161 H21 UU] can be formulated by saying 
that, under suitable hypotheses on H and i/j(0) and for large enough D, a system will 
equilibrate, or even thermalize, relative to a suitable class 

Our result is also of this form. We have just one operator in =sz/, namely P eq . We 
establish thermalization for arbitrary "0(0) assuming H is non-degenerate and satisfies 
((p a \Peq\4>a) ~ 1 for all a, which (we show) is typically true. 

Von Neumann's quantum ergodic theorem [IT] establishes thermalization for a fam- 
ily s$ of commuting observables; s# is the algebra generated by {M 1; . . . , M&} in the 
notation of Section 11.11 He assumes that the dimensions of the joint eigenspaces 
are not too small and not too large; that H obeys ( l84l) ; he makes an assumption about 
the relation between H and the subspaces Jtl that he shows is typically true; and he 
admits arbitrary ip(0). See [5] for further discussion. Rigol, Dunjko, and Olshanii [T3] 
numerically simulated an example system and concluded that it thermalizes relative to 
a certain class consisting of commuting observables. 

Tasaki [16] as well as Linden, Popescu, Short, and Winter [10J consider a system 
coupled to a heat bath, J^otai = ^sys ® ^bath, and take ssi to contain all operators 
of the form A sys ® lbath- Tasaki considers a rather special class of Hamiltonians and 
establishes thermalization assuming that 

max|(0 a |^(O))| 2 < 1, (87) 

a 

a condition that implies that many eigenstates of H contribute to ip(0) appreciably and 
that can (more or less) equivalently be rewritten as 

^|(0 a |^(O))| 4 «l. (88) 
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Under the assumption fl88|) on ?/>(0), Linden et al. establish equilibration for H satisfying 
dH3J) • They also establish a result in the direction of thermalization under the additional 
hypothesis that the dimension of the energy shell of the bath is much greater than 
dim^ ys . 

Reimann's mathematical result [12] can be described in the above scheme as follows. 
Let srf be the set of all observables A with (possibly degenerate) eigenvalues between 
and 1 such that the absolute difference between any two eigenvalues is at least (say) 
10~ 1000 . He establishes equilibration for H satisfying (1841) . assuming that ip(0) satisfies 

flEED. 
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